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Abstract 

The Cramer-Granville conjecture is an upper bound on prime gaps, gn = Pn+i — Pn < c log^ pn 
for some constant c > 1. Using a formula of Selberg, we first prove the weaker summed version: 
'n,n=i9n < Pn- In the remainder of the paper we investigate which properties of the 

fluctuations f(x) = 7r(x) — Li(x) would imply the Cramer-Granville conjecture is true and present 
two such properties, one of which assumes the Riemann Hypothesis; however we are unable to 
prove these properties are indeed satisfied. We argue that the conjecture is related to the enormity 
of the Skewes number. 


* andre.leclair@gmail.com 


1 



Let pn denote the n-th prime number with pi = 2, and dehne the gaps 


9n — Vn+l Pn (1) 

The lowest gap is obviously equal to 2, however an interesting question is how often this 
minimal gap occurs, and the twin primes conjecture says an inhnite number of times. There 
has recently been progress on this problem by Yitang Zhang [1]. In the other direction, 
upper bounds on gaps are also of interest. In practice the latter are more important since, 
given knowledge of a prime, they can aid in the location of the next prime. 

The Prime Number Theorem leads to Pn ~ nlogn, which implies the average gap Qn is 
log n. However it is known that the maximal gaps grow faster than this [2] : 

1 . Pn+l Pn 

lim sup — - - = cxD 

n^oo log Pn 

The Cramer-Granville conjecture m is the statement 

gn<c\og^Pn, Vn (2) 

for some constant c > 1. Cramer’s model supports the conjecture with c = 1, whereas 
Granville proposed that c > 1 and suggested c > 2e~'^ ~ 1.1229.... There is extensive 
numerical evidence for the conjecture [Sj. Furthermore, thus far this evidence supports the 
value c = 1 at least for n > 4: the greatest known value of the ratio Qn/ Pn is 0.9206386... 
for the prime 1693182318746371, which is somewhere around the n = 5 x 10^^-th prime. 

As usual let 7t{x) denote the number of primes less than or equal to x. Riemann derived 
an explicit formula for 7r{x) in terms of an inhnite sum over zeros of C(s) inside the critical 
strip 0 > 3fJ(s) < 1. The Prime Number Theorem (PNT) is the result that the leading term 
is 



It was proven independently by Hadamard and de la Vallee Poussin using Riemann’s formula 
for 7i{x) and showing that there are no zeros with 3fJ(s) = 1. 

Cramer’s original conjecture is essentially based on the PNT. The Cramer model is a 
probabilistic model of the primes, in which one assumes that the probability of a natural 
number of size x being prime is 1/log x. Cramer proved that in this model, the conjecture 
holds true with probability one. Since there is a great deal of numerical evidence the conjec¬ 
ture is correct, this suggests that a proof of it may only require something slightly stronger 
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than the PNT, and this will what we will investigate here. A resnlt of this kind is dne to 
Selberg [U]. The latter led to an independent, so-called elementary proof of the PNT, in 
that it does not rely on Riemann’s formula for 7r{x). The starting point for this article is a 
formula of Selberg, namely Theorem 1 below. 

The Cramer-Granville conjecture may be difficult, or even impossible, to prove for a 
single given n. On the other hand, if it is known to be true by direct computation for all 
n < Nq for some high enough Nq, one can attempt to prove the conjecture with a bootstrap 
principle that extrapolates to inhnity using some asymptotic formulas valid in the limit of 
large N. The asymptotic formulas must be such that the relative fluctuations decrease fast 
enough not to spoil the validity of the conjecture. This is in analogy to statistical physics of 
a system of N particles, where the relative fluctuations are typically of order l/y/N so that 
results just get better and better as one increases N, i.e. in the so-called thermodynamic 
limit. 

In the sequel, relative fluctuations will be quantihed as follows. Since Li(a;) is the leading 
term, 

7i{x) = Li(a;) -F f(a;) (4) 

where the fluctuating term f(a:) grows more slowly than the smooth part Li(x) 
i.e. hma,_^oo f( 2 ^)/Li(a;) = 0. We will consider different bounds on f(a;) depending 
one assumes the Riemann Hypothesis or not. 

We begin with: 

Theorem 1. (Selberg) 

2x log X + 0{x) = log ^p + logplogg (5) 

p<x p^q with pq<x 

where p, q are primes. 

Let us hrst make some simple observations based on the above theorem. Let Si (x) denote 
the hrst sum on the RHS of the above equation and S 2 {x) the second sum. Si includes terms 
up to log^ x, whereas S 2 contains terms with p = g up to log^ -y/a?, which are much smaller. 
There are additional terms in S 2 compared to Si for p > q, where the largest prime is 


~ x/ logo:, 
on whether 
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approximately x/2 corresponding to the term log(x/2) log 2, and they are also considerably 
smaller. This strongly suggests that Si{x) > 5*2(x). One can easily check numerically that 
for all X < piQi = 104729, Si{x) is signihcantly larger than S 2 {x) and the difference increases 
with X. For instance, for x = pio^, Si{x) — S 2 {x) = 686787.25... For larger x, the difference 
Si — S 2 only continues to grow. These considerations lead us to formulate the following 
lemma: 


Lemma 1. 

^ log p log g < ^log^p (6) 

p,q with pq<x 

Proof. For x < pjsi^ where Nq = 10^ for instance or even much smaller, the above inequality 
is easily verihed by direct computation. For higher x, the relative fluctuations of Si and S 2 
are very small compared to the difference Si — S 2 . Thus, to go to higher x, one can use the 
following asymptotic formulas: 


^log^p ~ 

p<x 

logplogg ~ 

p,q with pq<x 



X log X — X — 2 log 2 + 2 
[log t log u] = X log X — (2 + log 2)x + 4 


(7) 


Therefore S'i(x) — S' 2 (x) ~ (1 + log2)x for large x, and this linear growth in x is much larger 
than the fluctuations for large enough x. □ 


Theorem 2. For large enough N > Nq for some finite Nq, one has 

N N 

< '^iog^Pn ( 8 ) 

n=l n=l 

Proof. From Lemma 1 and Theorem 1, one has 

xlogx + 0(x) < E log^p (9) 

p<x 

Let X = Pn +1 — e with e small and positive, and consider the limit e —)■ 0. Then (|^ can be 
expressed as 

N 

P7V+llogp7v+l + 0(p7V+l) < ^log^Pn (10) 

n=l 
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It should be kept in mind that the 0{pn+i) term can be negative here. Nevertheless, for 
large enough N, p^+i < Pn+i logpjv+i + 0{pn+i). Next, noting that pn+i = En=i fi'n + 2 
proves the theorem. □ 

Theorem 2 is consistent with the Cramer-Granville conjecture, and suggests c = 1, al¬ 
though it is also consistent with c > 1 since the fluctuations in the sum could just average 
out to give c = 1. Furthermore it is clearly not enough to establish it since it does not im¬ 
ply that each individual term in the sum satishes the inequality. For the remainder of this 
article, we propose conditions on the fluctuations which would imply the Cramer-Granville 
conjecture, however we are unable to prove these conditions are true. 


Proposition 1. Define 




p<x 


s(p) 


( 11 ) 


where g{pn) = Qn- Then if A{x) with c > 1 is a monotonically increasing function of x in 
a region xi < x < X 2 then the Cramer-Granville conjecture is true for all primes p in the 
region xi < p < X 2 . 


Proof. Dehne the discrete function of N 

N 

n=l 

Now D{N) may be viewed as the above function A of Pn+i, i-e. D{N) = A^pn+i). Since 
Pn +1 is a monotonically increasing function of N, then if A{p) is a monotonically increasing 
function of p for large enough p, then D{N) is a monotonically increasing function of N 
for large enough N > Nq. This in turn implies that for N > Nq, at each step in the sum, 
TV — 1 —)■ A, one has clog^PAr > qn- □ 


Proposition 2. 


A{x) 


where 


for some constant B ^ 5. 


X log a; 


(c+1) 

c 


X -P t){x) + 0(1) 


Bx 
log a; 


< I)(a;) < 


Bx 
log a; 


(13) 

(14) 
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Proof. One has the exact formula 


px 

^log^p= / h7r(t)log^t 

p<x “^2 


(15) 


It is known that [ZllSI 


X X 1.8a; , , x x 2.51a; 

+ :—^ ^ < 7r(a;) < ;-h :—^ + 


(16) 


log a; ' log^a; log^a; log a; log^a; log^a; 

for large enough x; the hrst inequality requires x > 32299 and the second x > 355991. Thus 
we can simply write this as 

' X 


, . X X 2x ^ 

Tl(x) = -h - 9 -h :- 5 -h O 


log X 


(17) 


log a: ' log^a; log^x 

where the hrst three terms come from the expansion of Li(x). The bound dl?] ) is only valid 
for large enough x. However by changing the constants one can obtain a bound valid for all 
x: 


7r(x) = :r—-h 


where 


log a; ' log^a; ' log^x 
Bx 


log^x 


< f(a;) < 


2 x , 

, 3 + f(^) 

log X 

(18) 

Bx, 

log^ X 

(19) 


for some constant B k, h. We determined this value of B by verifying (19) is valid for x 


below the value where (17) becomes valid. Integrating (15) by parts, this implies (14). □ 


Dehne the function h{x) such that 


where —B < b{x) < B. One has 


^(x) = 


&(x)x 

logx 


dA , 1 b(x) 

— = logx-h ^- 

dx c log X 


+ 


b'{x)x 


( 20 ) 


( 21 ) 


log X / log X 

where b'{x) is formally db{x)/dx. Due to the jump discontinuities of b{x) where the derivative 
is not dehned, one needs to formally dehne b'{x). On a prime Pn let us dehne it as follows: 

b{pn+i) - b{pn) 


b\Pn) = 


( 22 ) 


Vn-\-l Pn 

We then dehne b'{x) for other x as a linear interpolation between b'{pn), n = 1,2, 3. 

Recall that based on Proposition]^ we wish to show that dA{x)/dx > 0 for x > xq for 
some hnite Xq. Let us hrst provide some evidence that the b' term in (|2T|) can be neglected, 
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i.e. does not spoil the monotonicity of A{x) at high x. To this end, let us first assume b' = 0 
so that b is constant. Then as far as monotonicity goes, in the worse case b{x) = —B, and 
A (a:) would be monotonically increasing for 

X > eV2c+Vi/4c2+-B ^23) 


Note that for a given B, the above equation does not single out any particularly special value 
of c. However, if, for instance, one wishes the Cramer-Granville conjecture to be true beyond 


the hrst 5 primes, then c ~ 1, since for B = 5, (23) gives x > 16.3 > p^. Interestingly, the 
original Cramer conjecture with c = 1 is only known to fail for the hrst 4 primes, and beyond 
this has been verihed numerically to very high N > 10^° or so. If we chose a smaller value 
of c, then A{x) simply becomes monotonically increasing beyond a higher value of x. The 
fact that this analysis led to the very likely correct prediction that the Cramer-Granville 
conjecture is valid for primes p > ps, suggests that the b'{x) must indeed be very small. As 
we discuss below, this appears to be related to the enormous value of the Skewes number. 
This analysis also suggests the value of c is hxed by the gaps in the low primes. 

One can state something more precise as follows. Only if b'{x) < 0 can it spoil the 
monotonicity. If the following condition holds 


b\p) > 


log^p 

p 


clogp 


(24) 


where p is prime with p > xq for some Xq, then the Cramer-Granville conjecture is true for 
p > Xq. Numerically, we checked that the above condition is valid for 5 < p < 10^^ with 
c = 1, however we cannot prove that it is valid beyond this. 

Stronger bounds on the fluctuations do not significantly improve the analysis. Since it is 
believed that the strongest bounds come from assuming the Riemann Hypothesis, let us do 
so. Schoenfeld showed [9] that 


|f(a;)| = |7r(a;) — Li(x)| < Ky/x\ogx 


(25) 


for X > 2657 where K = l/Svr. By increasing K, one can make the above bound valid for 
all X] we found K ^ 1/3. Repeating the above steps leads to 

A(a;) = xloga: — -I--I- 0(1) (26) 

c 

where 

l)(a;) = k{x)^/x\o^ X (27) 
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k'(x) 



FIG. 1: k'{x) evaluated at primes. The smooth curve is the RHS of (30) with c = 1. 


From (25) one has 


-K < k{x) < K 


(28) 


In (27) we have dropped a few terms with lower powers of logx, such as \/x log^ x since they 


will not signihcantly change our conclusions. 

Dehne the formal derivative k'{x) of k{x) as in (22) with b ^ k. In Figurej^we plot some 
values. We then have 


dA(x) , 1 k(x)\ogx 3 

-;- = logo: -1- ^- h k {x)y/x\og^ X 

ax c \ X 


(29) 

As before, if one hrst assumes the k' term is negligibly small, then the worse case is /c = —K. 
One hnds that for c = 1 and K = 1/3, dA{x)/dx > 0 for (c > p 2 - Thus, once again we hnd 
that neglecting the k' term apparently leads to the right conclusion, i.e. that the Cramer- 
Granville conjecture is true with c = 1 for all primes p > p^. In particular, if the following 
condition holds 


k'{p) > 


1 - 


(30) 


y/plog^p V* clogp^ 

for p > xo then the Cramer-Granville conjecture holds for primes p > xq. We checked 
numerically that the above condition with c = 1 is satished for all primes p > 3 up to 10^^, 
however again we cannot prove this result. See Figure [T] 


Additional evidence for k'{x) being very small comes from the largeness of the Skewes 
number. Up to values of x that are within reach numerically, Li(x) > 7r(x), and for a long 
time it was believed that this persists to inhnity. Let Ski denote the Skewes number, which 
is the hrst x where the crossover Li(x) < 7r(x) occurs. Littlewood proved Ski exists, i.e. 
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isn’t infinite, and Skewes first estimated it as Ski < . This has since been reduced 

to Ski < 10^^® [To]. Numerically it is also known that Ski > 10^^. Now k{x) starts out 
negative with k{x) > —l/Svr for low x, then very slowly increases on average until it hnally 
changes sign at Ski. Thus the average of k'{x) in the region 2 < x < Ski is approximately 
(SvrSki)”^, which is exceedingly small. 

It is interesting to try and determine the Skewes number from the smallnes of k'{x). 
Numerically we hnd that k{x) = 0(log log logo:), so that k'{x) is indeed small at large x. In 
particular, For x < 10^° a reasonably good £t is that on average 

k{x) ^ —A{a — log log log a:) (31) 

with a ~ 1.3. If this approximation persists, then Ski = e® . This is obviously very 
sensitive to a. For a = 1.3, Ski = 10^^, however a small increase to a = 1.5 would already 
give Ski = 10^®. Any values a > 2 are already ruled out since for a = 2, Ski > 10^°^. 
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